Potentially functional variants of ERRFI1 in hypoxia‐related genes predict survival of non‐small cell lung cancer patients

Abstract Background Hypoxia is often involved in tumor microenvironment, and the hypoxia‐induced signaling pathways play a key role in aggressive cancer phenotypes, including angiogenesis, immune evasion, and therapy resistance. However, it is unknown what role genetic variants in the hypoxia‐related genes play in survival of patients with non‐small cell lung cancer (NSCLC). Methods We evaluated the associations between 16,092 single‐nucleotide polymorphisms (SNPs) in 182 hypoxia‐related genes and survival outcomes of NSCLC patients. Data from the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial were used as the discovery dataset, and the Harvard Lung Cancer Susceptibility (HLCS) Study served as the replication dataset. We also performed additional linkage disequilibrium analysis and a stepwise multivariable Cox proportional hazards regression analysis in the PLCO dataset. Results An independent SNP, ERRFI1 rs28624 A > C, was identified with an adjusted hazards ratio (HR) of 1.31 (95% CI = 1.14–1.51, p = 0.0001) for overall survival (OS). In further analyses, unfavorable genotypes AC and CC, compared with the AA genotype, were associated a worse OS (HR = 1.20, 95% CI = 1.03–1.39, p = 0.014) and disease‐specific survival (HR = 1.21, 95% CI = 1.04–1.42, p = 0.016). Further expression quantitative trait loci analysis indicated that ERRFI1 rs28624C genotypes were significantly associated with higher ERRFI1 mRNA expression levels in the whole blood. Additional analysis showed that high ERRFI1 mRNA expression levels were associated with a worse OS in patients with lung adenocarcinoma. Conclusion Our findings suggest that genetic variants in the hypoxia‐related gene ERRFI1 may modulate NSCLC survival, potentially through their effect on the gene expression.


| INTRODUCTION
Lung cancer is a pervasive malignancy significantly impacting human health, particularly holding the highest global mortality rate among cancer-related fatalities. 1 In 2023, there were approximately 238,340 newly diagnosed cases of lung cancer in the United States, resulting in about 127,070 fatalities, underscoring its profound impact on population health. 2 Non-small cell lung cancer (NSCLC) constitutes approximately 85% of all lung cancer diagnoses.Despite notable progress in diagnosis and treatment, the prognosis for NSCLC remains persistently unfavorable. 3Identifying additional factors associated with the prognosis of NSCLC patients becomes imperative for clinically managing those patients with a bleak survival.5][6] The discovery and characterization of functional SNPs not only enhance our comprehension of the genetic landscape but also present opportunities for translating this knowledge into more effective clinical management and therapeutic interventions.
Hypoxia, a condition characterized by insufficient oxygen supply to tissues, plays a pivotal role in the progression of various diseases, including cancer.It commonly occurs in solid tumors due to rapid cell proliferation, inadequate blood supply, and aberrant vasculature. 7In the presence of hypoxic pressure, a series of downstream pathways, including hypoxia-inducible factor (HIF), autophagy, energy metabolic pathways, and cell stress pathways, are activated, facilitating cellular response to hypoxic stress. 8,91][12][13][14] The tumor microenvironment in lung cancer often exhibits regions of hypoxia, and the hypoxia-induced signaling pathways play a key role in the aggressive nature of lung cancer, affecting angiogenesis, immune evasion, and resistance to cancer therapies.Hypoxia increases vascular endothelial growth factor production, promoting the growth of blood vessels to sustain tumor growth. 15Targeting this process has become a therapeutic strategy in treating lung cancer. 16,17Hypoxia in the tumor microenvironment can suppress the anti-tumor immune response by promoting the recruitment of immunosuppressive cells and inhibiting the activity of cytotoxic T cells. 18This immune evasion contributes to the ability of lung cancer cells to escape immune surveillance, facilitating tumor progression.Understanding the intricate relationship between hypoxia and survival of patients with lung cancer can lead to the development of novel clinical management and therapeutic approaches.
In recent years, biological pathway analysis methods using genotyping data from genome-wide association studies (GWAS) can reveal genetic variants within genes involved in cancer-related pathways.To date, no published studies have explored the effects of genetic variants in the hypoxia-related genes on survival of NSCLC patients.Therefore, we conducted the present study to investigate the associations between genetic variants in the hypoxiarelated genes and NSCLC survival, utilizing genotyping datasets from two GWAS studies previously published.

Further expression quantitative trait loci analysis indicated that ERRFI1
rs28624C genotypes were significantly associated with higher ERRFI1 mRNA expression levels in the whole blood.Additional analysis showed that high ERRFI1 mRNA expression levels were associated with a worse OS in patients with lung adenocarcinoma.

Conclusion:
Our findings suggest that genetic variants in the hypoxia-related gene ERRFI1 may modulate NSCLC survival, potentially through their effect on the gene expression.

K E Y W O R D S
ERRFI1, genome-wide association study, hypoxia, non-small cell lung cancer, overall survival, single-nucleotide polymorphism conducted by the National Cancer Institute (NCI).The PLCO trial involved 77,500 male and 77,500 female participants, aged 55-74 years, who were recruited from 10 medical centers in the United States between 1993 and 2011. 19Each participant was randomly assigned to either the intervention group receiving screening or the control group receiving the standard care.Baseline blood samples and demographic information were collected from all the participants.Over a 13-year follow-up period, meticulous records were maintained, covering the aspects of tumor diagnosis, histopathology, tumor staging, treatment modalities, and survival outcomes.Genotyping of whole blood genomic DNA was performed by using Illumina HumanHap240Sv1.0 and HumanHap550v3.0 platforms (dbGaP accession numbers: phs000093.v2.P2 and phs000336.v1.p1). 20,21For survival analysis, a total of 1185 NSCLC patients of Caucasian ethnicity, with comprehensive personal information, were identified as eligible for inclusion in the present study.All the participants provided a written informed consent, having authorized the use of their datasets in the PLCO trial.Furthermore, the approval for the collection and use of data and samples was granted by the institutional review board at each participating institution.
The replication dataset was from the Harvard University Lung Cancer Susceptibility (HLCS) study, which was initiated in 1992. 22This dataset provided the needed information from 984 Caucasian patients with histologically confirmed NSCLC to validate the significant SNPs identified in the discovery dataset.In the HLCS study, DNA extraction from patient blood samples was conducted using the Auto Pure Large Sample Nucleic Acid Purification System by QIAGEN (Venlo, Limburg, Netherlands).Genotyping was executed using the Illumina Humanhap610-Quad array, with subsequent imputation facilitated by the Minimac3 software, leveraging sequencing data from the 1000 Genomes Project. 23he approval for utilizing data from both datasets mentioned above, along with access to the dbGaP database (Project #6404), was granted by the Internal Review Board of Duke University School of Medicine (#Pro00054575) and the National Center for Biotechnology Information (NCBI).A detailed comparison between the PLCO trial (n = 1185) and the HLCS study (n = 984) is presented in Table S1.

| Gene and SNP selection
The Hallmark gene sets are curated based on their ability to capture well-defined biological states or processes, such as hypoxia, making them highly relevant for our study.We obtained the hypoxia-related genes from the Hallmark gene sets within the Molecular Signatures Database (MSigDB) using the keyword "hypoxia."To ensure the reliability of our analyses, we excluded 18 genes located on the X chromosome due to their unique inheritance patterns and potential for gender-specific effects, which might introduce confounding variables.We identified 182 genes as candidate genes for further analyses (Table S2).Subsequently, we utilized Minimac4 for imputation on the Michigan Imputation Server (https:// imput ation server.sph.umich.edu), employing the 1000 Genomes Project (phase 3) dataset of European ancestry as the reference panel.Next, we extracted all SNPs within ±2kb flanking regions of these genes, adhering to specific selection criteria, which included an r 2 value ≥0.3 (Figure S1), a minor allelic frequency (MAF) ≥0.05, a genotyping rate of ≥95%, and a Hardy-Weinberg equilibrium (HWE) p-Value of ≥1 × 10 −5 .As a result, a total of 2225 genotyped SNPs were collected from the PLCO dataset, along with 13,867 imputed SNPs for subsequent analyses.

| Statistical analysis
In the present study, the follow-up duration for survival analysis was defined in the PLCO dataset as the period from the date of NSCLC diagnosis of the patients to the date of the last follow-up or the date of death.The primary endpoint of the study was OS of the patients, with simultaneous analysis of disease-specific survival (DSS) in a similar manner.Initially, a single-locus analysis was conducted, followed by a multivariable Cox proportional hazards regression analysis.The Cox regression analysis involved adjustments for age, sex, smoking status, histopathology, tumor staging, chemotherapy, radiotherapy, surgery, and the first four principal components (Table S3), all under an additive genetic model.The objective was to assess the associations between each of 16,092 SNPs and survival of NSCLC patients in the discovery dataset.Considering that over 90% of SNPs were estimated to be in linkage disequilibrium (LD), a multiple testing correction was implemented using Bayesian false discovery probability (BFDP) with a cut-off value of 0.80.This correction aimed to reduce the likelihood of potential false-positive results as recommended. 24A prior probability of 0.10 was set to detect variant genotypes or minor allele-associated hazards ratio (HR) with an upper limit of 3.0, and a significance level of p < 0.05 was considered.
We then replicated the identified SNPs from the discovery dataset using a multivariable Cox regression model in the replication dataset.The results from both datasets were subsequently combined through an inverse varianceweighted meta-analysis.Inter-study heterogeneity was assessed using Cochran's Q-test and the heterogeneity statistic (I 2 ) to guide the model selection.In cases without heterogeneity (P het >0.100 and I 2 < 50%), we employed a fixed-effects model; otherwise, a random-effects model was used.To identify representative SNPs, we utilized online bioinformatics tools-specifically, RegulomeDB (http:// www.regul omedb.org/ ) and HaploReg v4.2 (https:// pubs.broad insti tute.org/mammals/haploreg/ haploreg.php)with criteria such as r 2 < 0.8 and multifunctionality.In the pursuit of independent SNPs associated with NSCLC survival, we constructed a stepwise multivariable Cox regression model to incorporate the first four principal components of genotype data from the discovery dataset, along with 54 SNPs from prior publications in addition to the adjustments for available demographic and clinical variables.The outcomes for the selected SNPs were also visually presented through Manhattan plots by using Haploview v4.1 and regional association plots generated by using Locus Zoom (http:// locus zoom.sph.umich.edu).
After adjusting for covariables in the discovery dataset with multiple comparison correction, we explored the associations of SNP genotypes with both NSCLC OS and DSS.Utilizing additive, dominant, and recessive models, we assessed survival-associated SNP genotypes, categorizing those with an HR > 1 and a significance level of p < 0.05 as unfavorable genotypes.To assess the effects of all genotypes, particularly unfavorable ones, of the identified SNP on survival, we further generated Kaplan-Meier curves to visualize the effects on survival probabilities.Chi-square Q-tests were employed to explore interactions among various clinical subgroups in stratified analyses, allowing control for potential confounders and potential heterogeneity.Subsequently, using the "survival" and "time ROC" packages in R software (version 4.2.1),we constructed receiver operating characteristic (ROC) curves.The area under the curve (AUC) was calculated to develop survival prediction models, evaluating the predictive ability of combined genotypes for NSCLC survival. 25To confirm the independent association of SNP with the corresponding mRNA expression levels, an expression quantitative trait loci (eQTL) analysis was conducted, in which linear regression models were applied to mRNA expression data from 373 individuals of European descent in the 1000 Genomes Project as well as from 670 whole blood samples and 515 normal lung tissues in the genotype-tissue expression (GTEx) project. 26,27he analysis of mRNA expression levels in the Cancer Genome Atlas (TCGA) database included a comparison of 109 paired non-small cell lung cancer tissues and adjacent normal tissues using the paired t-test to explore differences in mRNA expression levels between tumor and adjacent normal tissues. 28The Gene Expression Profiling Interactive Analysis 2 (GEPIA2) online database (http:// gepia2.cance r-pku.cn) integrates published gene expression data and survival information from 960 NSCLC samples collected from TCGA databases. 29aplan-Meier (KM) analysis of this database was also used to visualize the association between mRNA expression and NSCLC survival.All statistical analyses were performed using SAS software (version 9.4; SAS Institute, Cary, NC, USA), unless otherwise specified.

| Associations between SNPs in the hypoxia-related genes and NSCLC survival
The characteristics of 1185 NSCLC patients from the discovery dataset and 984 NSCLC patients from the replication dataset are detailed in Table S1.After implementing multiple testing correction with BFDP ≤0.8, a total of 630 SNPs were found to be associated with NSCLC OS at a significance level of p < 0.05.These identified SNPs underwent further replication using the replication HLCS dataset, yielding 49 SNPs within seven genes, which remained statistically significant.Notably, three SNPs (RORA rs922782, PLIN2 rs7867814, and NEDDL4 rs1160748) had been previously reported in other distinct pathways. 4-6Among the seven genes, two (PLIN2 and PRKCA) each featured only one associated SNP.Further LD analysis through Haploview software on the 47 SNPs resulted in eight SNPs from five genes as the tagger SNPs (Figure S2).Subsequently, these eight SNPs, along with the SNPs from PLIN2 and PRKCA, totaling 10 SNPs, underwent further functional prediction and a stepwise multivariable Cox regression analysis (Table S4).Among these, six SNPs, in the presence of 54 previously published SNPs, were subjected to a stepwise multivariable Cox regression analysis.
Because the unavailability of detailed genotyping data in the HLCS dataset, we opted for a stepwise Cox regression analysis to account for clinical variables only available in the PLCO dataset, in assessing the independence of SNP effects on NSCLC survival, with adjustment for an additional 54 previously reported survival-associated SNPs from the same dataset.Ultimately, among the unpublished SNPs, only rs28624 in ERRFI1 maintained its independent association with NSCLC OS (Table 1).The meta-analysis conducted on this ERRFI1 rs28624 SNP across both discovery and replication datasets produced consistent results, illustrating uniformity between the discovery and replication datasets without heterogeneity (Table 2).Furthermore, a summary of the selected SNPs is presented in the Manhattan plot (Figure S3) and the regional association plot (Figure S4).
As shown in Table 3, the carriers of the ERRFI1 rs28624 C allele had an elevated risk of death or a poorer survival in NSCLC patients (P trend = 0.006 for OS and P trend = 0.019 for DSS); these findings are also illustrated in Kaplan-Meier

| Stratified analyses of independent SNP associated with NSCLC survival in the PLCO dataset
To enhance the precision of survival estimation, we classified ERRFI1 rs28624 AC/CC as the unfavorable genotypes and conducted further stratified analyses using the PLCO dataset to investigate whether their effect on NSCLC survival was influenced by other factors such as age, sex, smoking status, histology, tumor stage, chemotherapy, radiotherapy, and surgery.The results indicated that neither the rs28624 AA group nor the rs28624 AC/CC unfavorable genotype significant interactions with any demographic and clinical covariables in NSCLC OS and DSS (all P inter > 0.05, Table S5).

| Time dependent AUC and ROC curve for independent SNP for the prediction of NSCLC survival
Then, we assessed predictive capability of ERRFI1 rs28624 for 5-year NSCLC survival by contrasting the AUCs of models incorporating clinical variables with that integrating the identified independent SNP.The ROC curves illustrated that the inclusion of the identified SNP in the Cox model with 5-year OS clinical variable borderline increased the AUC from 86.99% to 87.64% (p = 0.088, Figure S5B).Additionally, the incorporation of the identified SNP into the Cox model with 5-year DSS prediction also borderline raised the AUC from 86.70% to 87.47% (p = 0.063, Figures S5A,C,D).These findings indicated that the identified SNP alone did not statistically significantly improve the predictive performance of the models for 5-year survival.

| The eQTL analysis
To perform the eQTL analysis, we started with the analysis of RNA-Seq data derived from lymphoblastoid cell lines of 373 individuals of European descent participating in the 1000 Genomes Project.The results did not show a correlation between ERRFI1 rs28624 C allele and its corresponding mRNA expression levels across all three genetic models (Figures S6A-C).Following this initial analysis, we also performed an eQTL analysis utilizing data obtained from 515 normal lung tissues and 670 whole blood samples sourced from the GTEx project.This analysis revealed a statistically significant association between the rs28624 C allele and elevated expression levels of ERRFI1 in whole blood samples (p = 0.031, Figure 2E).Specifically, participants with the CC genotype had higher mRNA expression levels than with AA and AC genotypes.However, it is noteworthy that such an association was not observed in normal lung tissues (p = 0.835, Figure S6D).In analyzing functional relevance of the identified independent SNP, bioinformatics evaluations employing RegulomeDB and Haploreg revealed some functional implications associated with ERRFI1 rs28624 A>C.Specifically, the variant influences enhancer histone marks and DNAse sensitivity and also induces alterations in sequence motifs, as detailed in Table S4.

| Differential mRNA expression analysis and survival of NSCLC
To further explore potential roles of the independent SNP in the survival of NSCLC, we first performed an analysis of ERRFI1 mRNA expression levels sourced from the TCGA database, which included 109 pairs of tumor samples (58 lung adenocarcinoma and 51 lung squamous cell carcinoma) and their corresponding adjacent normal tissues.Subsequently, we evaluated the associations between gene mRNA expression levels and the survival of NSCLC patients utilizing the GEPIA2 database.As depicted in Figure 3A-C, ERRFI1 mRNA expression levels were significantly elevated in the combined data for both lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) tissues (p = 0.006) as well as for LUAD tissues (p = 0.003), but not for LUSC tissues (p = 0.956).However, higher mRNA expression levels of ERRFI1 were associated with a worse OS in the combined LUAD and LUSC group (p = 0.0047, Figure 3D), consistent with the findings in LUAD patients (p = 0.044, Figure 3E), but not in LUSC patients (p = 0.14, Figure 3F).
F I G U R E 3 Differential mRNA expression analysis and survival of ERRFI1 in the TCGA and GEPIA dataset.In TCGA database, relative to normal tissues, elevated ERRFI1 mRNA levels were observed in (A) both overall LUAD and LUSC tissues, as well as in (B) isolated LUAD tissues, but no significant difference was observed in (C) LUSC tissues.In the GEPIA database, increased ERRFI1 mRNA levels were correlated with worse survival outcomes in (D) both overall LUAD and LUSC patients, as well as in (E) isolated LUAD patients, but no significant difference in (F) LUSC patients.ERRFI1, ERBB Receptor Feedback Inhibitor 1; GEPIA, Gene Expression Profiling Interactive Analysis; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; TCGA, The Cancer Genome Atlas.
In the present study, we tested the hypothesis that SNPs in the hypoxia-related genes are associated with the survival of NSCLC using two publicly available GWAS databases.Through multivariable Cox analysis, we identified an independent SNP, ERRFI1 rs28624 A>C, which was significantly associated with worse OS and DSS in American Caucasians.Subsequent eQTL analysis revealed a correlation between the rs28624 C variant allele and higher ERRFI1 mRNA expression levels in whole blood samples sourced from the GTEx project.Moreover, ERRFI1 mRNA expression levels significantly increased in paired LUAD tissues but not in paired LUSC tissue.The observed differences in ERRFI1 mRNA levels between LUAD and LUSC may be attributed to the inherent biological heterogeneity characterizing these distinct subtypes of NSCLC.Consistently, higher mRNA expression levels of ERRFI1 were associated with a poorer OS in patients with LUAD, while no significant association was observed in patients with LUSC.Overall, these findings suggest that genetic variants in the hypoxia-related gene ERRFI1 are associated with the survival of NSCLC, particularly in patients with LUAD, likely through alterations in the gene expression.
The ERRFI1 (ERBB receptor feedback inhibitor 1), also known as MIG-6 (mitogen-inducible gene 6 protein), Gene-33, and RALT (receptor-associated late transducer), is situated on chromosome 1p36.It plays a crucial role in cellular signal transduction. 30[33] Its primary function is the negative regulation of ERBB family receptors, including the epidermal growth factor receptor (EGFR) and other related receptors. 305][36] However, its role in breast cancer progression appears to be context-dependent, exhibiting both pro-tumorigenic and anti-tumorigenic effects in different types and stages of breast cancer progression. 37,38owever, ERRFI1 possesses a dual mechanism for inhibiting the EGFR signaling, involving a direct binding to EGFR, leading to the inhibition of EGFR catalytic activity, as well as directing lysosomal degradation of EGFR. 39,40EGFR-tyrosine kinase Inhibitors (EGFR-TKIs) have become crucial in the treatment of NSCLC with EGFR activating mutations, which are relatively rare in LUSC (approximately 4%). 41Adenocarcinoma, a predominant histological subtype of NSCLC, often features mutations in EGFR.In our analysis of the GEPIA2 data, high expression levels of ERRFI1 were associated with a poor survival in patients with LUAC.However, in a mouse model of LUAC driven by mutant EGFR, the absence of ERRFI1 accelerates both the onset and progression of the tumour. 42In contrast, another preclinical study suggested that suppression ERRFI1 overcomes acquired EGFR-TKIs resistance in LUAD. 43Furthermore, patients with a lower ERRFI1/EGFR ratio exhibited elevated response rates to EGFR-TKI, along with significantly extended progression-free survival. 44In NSCLC cellular models, tumors with a low ERRFI1/miR200c ratio showed a higher sensitivity to EGFR-TKI. 45Hypoxiainduced upregulation of ERRFI1 results in dormancy and resistance to EGFR-TKI in primary cultured NSCLC cells with EGFR mutations. 45Overall, these results suggest that elevated ERRFI1 expression levels may be associated with suboptimal therapeutic efficacy in the EGFR-TKI treatment.Clinical data also showed that patients with high ERRFI1 expression levels had a poor prognosis after the EGFR-TKI treatment in LUAC with the EGFR mutation. 45Similarly, another study suggested that high ERRFI1 expression levels were positively correlated with a poor prognosis and EGFR-TKI resistance in patients with LUAC with EGFR mutation. 43Furthermore, Maity et al. 42 demonstrated that although ERRFI1 deficiency reduced mouse survival due to an accelerated tumorigenesis, mutant EGFR can partially circumvent inhibition by ERRFI1 in LUAC cells through phosphorylation of ERRFI1 on a key residue.Under EGFR mutation conditions and prolonged exposure to EGFR-TKIs, phosphorylated ERRFI1 ceases to operate as a tumor suppressor; instead, it actively promotes the survival of cancer cells. 43n summary, the present is the first, to the best of our knowledge, to focus on the associations of potentially functional genetic variants in the hypoxia-related genes with the survival of NSCLC patients.By utilizing two previously published GWAS datasets, we identified a potentially functional SNP in ERRFI1, which may serve as a biomarker for survival in NSCLC by a potential mechanism of modulating the expression of the gene.To validate these findings and to decipher the intricate molecular mechanisms involved, further experimental investigations are warranted.While the present study provides valuable insights, it is important to acknowledge its limitations.First, our analysis was based on GWAS datasets from populations of Caucasian descent, raising questions about the generalizability to other ethnic groups.Additionally, the two GWAS databases may inherently be constrained by limited availability of treatment details and EGFR mutation status, which hinder further in-depth analysis.Moreover, although we identified a significant survival-associated SNP, our study did not delve into the detailed functional consequences of this genetic variant at the molecular level.Lastly, our analysis was confined to the available data, potentially leading to the omission of other relevant confounding factors or interactions that may have played a role.Nevertheless, our study contributes to the understanding of the role that genetic variants of hypoxia-related genes may play in NSCLC survival, offering potential insights for future investigations.

ACKNOWLEDGMENTS
The authors thank all the participants of the PLCO Cancer Screening Trial.The authors also thank the National Cancer Institute for granting access to the data collected during the PLCO trial.It is important to note that the statements provided in this document solely reflect the perspectives of the authors and do not imply concurrence or endorsement by the National Cancer Institute.The authors would also like to acknowledge the dbGaP repository for providing valuable cancer genotyping datasets.The accession numbers for the lung cancer datasets are phs000336.v1.p1 and phs000093.v2.p2.A list of contributing investigators and funding agencies for those studies can be found in the Supplemental Data.

F I G U R E 2
Prediction of survival with genotypes and eQTL analysis for SNP in ERRFI1.Kaplan-Meier survival curves for OS in the PLCO dataset for (A) each genotype and (B) the combined unfavorable genotypes; Kaplan-Meier survival curves for DSS in the PLCO dataset for (C) each genotype and (D) the combined unfavorable genotypes.(E) ERRFI1 rs28624_C allele was associated with higher mRNA expression of ERRFI1 in whole blood in GTEx project.eQTL, expression quantitative trait; ERRFI1, ERBB Receptor Feedback Inhibitor 1; PLCO, The Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial; SNPs, single nucleotide polymorphisms.#Unfavorable genotypes were ERRFI1 rs28624 AC/CC.
The independent SNPs in multivariate Cox proportional hazards regression analysis with adjustment for other covariates and 54 previous published SNPs in the PLCO dataset.Associations of the independent SNP with overall survival in both discovery and validation datasets from two previously published NSCLC GWASs.
Associations of ERRFI1 rs28624 A>C with survival of NSCLC in the PLCO study.
T A B L E 3Abbreviations: CI, confidence interval; DSS, disease-specific survival; HR, hazards ratio; NSCLC, non-small cell lung cancer; OS, overall survival; PLCO, Prostate, Lung, Colorectal and Ovarian cancer screening trial; SNP, single nucleotide polymorphism.aAdjustsex, smoking status, histology, tumor stage, chemotherapy, surgery, radiotherapy, and principal components.b11missing date were ed for age, excluded.F I G U R E 1The flowchart of the present study.ERRFI1, ERBB Receptor Feedback Inhibitor 1; GWAS, Genome-